Technical  Research  Report 

Multiple  Sampling  for  Estimation  on  a  Finite  Horizon 


by  Maben  Rabi,  John  S.  Baras  and  George  Moustakides 


SEIL  TR  2006-2 
(ISR  TR  2006-7) 


ISR  develops,  applies  and  teaches  advanced  methodologies  of  design  and  analysis  to  solve  complex,  hierarchical, 
heterogeneous  and  dynamic  problems  of  engineering  technology  and  systems  for  industry  and  government. 

ISR  is  a  permanent  institute  of  the  University  of  Maryland,  within  the  Glenn  L.  Martin  Institute  of  Technol¬ 
ogy/A.  James  Clark  School  of  Engineering.  It  is  a  National  Science  Foundation  Engineering  Research  Center. 

Web  site  http://www.isr.umd.edu 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

2006 

2.  REPORT  TYPE 

3.  DATES  COVERED 

00-00-2006  to  00-00-2006 

4.  TITLE  AND  SUBTITLE 

5a.  CONTRACT  NUMBER 

Multiple  Sampling  for  Estimation  on  a  Finite  Horizon 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROIECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

University  of  Maryland, Institute  for  Systems  Research, Department  of 
Electrical  and  Computer  Engineering, College  Park, MD, 20742 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

15.  SUBIECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 
ABSTRACT 

18.  NUMBER 

OF  PAGES 

8 

19a.  NAME  OF 
RESPONSIBLE  PERSON 

a.  REPORT 

unclassified 

b.  ABSTRACT 

unclassified 

c.  THIS  PAGE 

unclassified 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


Multiple  Sampling  for  Estimation  on  a  Finite  Horizon 


Maben  Rabi,  John  S.  Baras,  and  George  Moustakides 


Abstract — We  discuss  some  multiple  sampling  problems 
that  arise  in  real-time  estimation  problems  with  limits  on  the 
number  of  samples.  The  quality  of  estimation  is  measured  by 
an  aggregate  squared  error  over  a  finite  horizon.  We  compare 
the  performances  of  the  best  detereministic,  level-triggered 
and  the  optimal  sampling  schemes.  We  restrict  the  signal  to 
be  either  a  Wiener  process  or  an  Ornstein-Uhlenbeck  process. 
For  the  Wiener  process,  we  provide  closed  form  expressions 
and  series  expansions.  For  the  Ornstein  Uhlenbeck  process, 
we  provide  procedures  for  numerical  computation.  Our  results 
show  that  level-triggered  sampling  is  almost  optimal  when  the 
signal  is  stable. 

I.  Event-triggered  sampling 

In  many  decision  and  control  problems,  we  can  impose  a 
certainty-equivalence  type  separation  into  a  signal  estima¬ 
tion  part  and  a  control  or  decision  design  part.  For  example, 
an  optimal  control  or  a  signal  detection  problem  can  be 
solved  by  certainty-equivalence  policies  which  treat  a  least- 
squares  estimate  of  the  signal  waveform  as  the  true  state.  In 
these  situations,  the  processing  of  available  measurements 
should  be  geared  towards  obtaining  the  best  quality  signal 
estimate.  In  Networked  Control  Systems  [1]  where  the 
sensors  have  only  a  limited  number  of  packets  (samples) 
to  transmit  to  the  supervisor,  the  sampling  design  affects 
the  quality  of  the  signal  estimate. 

Kushner  [2]  has  treated  a  problem  of  picking  a  fixed 
number  of  deterministic  sampling  times  for  a  finite  horizon 
linear  optimal  control  problem.  He  establishes  the  validity 
of  the  separation  principle  and  obtains  closed  form  expres¬ 
sions  for  the  minimum  cost  in  the  scalar  case.  The  collection 
[3]  treats  some  randomized  as  well  as  deterministic  but 
irregular  sampling  schemes  for  smoothing  and  control. 
Cambanis  and  Masry  [4]  have  treated  the  problem  of  pick¬ 
ing  the  best  deterministic  and  random  sampling  schedules 
for  hypothesis  testing  based  on  a  smoothed  estimate.  Their 
random  sampling  schedules  are  however  not  adapted  to  the 
signal  observed  by  the  sensor. 

For  the  problems  treated  in  this  paper,  we  seek  to  char¬ 
acterize  the  performance  gains  provided  by  event-trigerred 
sampling  policies.  Event-triggered  sampling  has  been  re¬ 
ferred  to  as  Febesgue-type  sampling  in  the  control  literature 
[5].  We  solve  a  sampling  design  problem  withn  three  classes 
of  sampling  strategies.  The  sampling  design  is  tailored  to  a 
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signal  filtering  task.  The  design  objective  is  to  minimize 
over  a  finite  horizon  the  distortion  of  a  filter  (real-time 
estimator)  of  the  signal  based  upon  the  stream  of  samples. 
This  minimization  is  performed  with  a  fixed  constraint  on 
the  maximum  number  of  samples  used.  In  [6]  a  related 
problem  in  discrete-time  is  treated. 

II.  Sampling  the  Wiener  process 
For  the  signal  to  be  estimated  (the  state  process),  we  will 
assume  the  model 

dxt  =  dWt ,  t  £  [0,7],  (1) 

with  xQ  =  0  and  VV)  being  a  standard  Brownian  motion.  A 
sensor  has  perfect  observations  of  the  state  and  transmits 
at  times  it  chooses,  current  samples  of  the  state  process. 
The  sensor  is  allowed  at  generate  at  most  N  samples  to 
be  transmitted  to  a  supervisor.  The  sampling  times  S  = 
{t1,...,tjv}  have  to  be  an  increasing  sequence  of  stopping 
times  with  respect  to  the  x-process.  They  also  have  to  lie 
within  the  interval  [0,7’].  Based  on  the  samples  and  the 
sample  times,  the  least-squares  estimate  for  the  supervisor 
x,  is  given  by  [7]: 

!0  if  0  <  t  <  Tj , 
xXj  if  x i<t<  xi+1  <  xN,  (2) 

Xtn  if  xN<t<T. 

The  quality  of  this  estimate  is  measured  by  the  aggregate 
squared  error  distortion: 


We  will  consider  three  sampling  strategies  and  characterize 
their  performance.  The  strategies  are: 

Deterministic  sampling:  The  sampling  sequence  S  is 
chosen  apriori  and  hence  independent  of  the  signal 
trajectory.  It  is  chosen  to  minimize  the  expected 
distortion  J.  In  this  scheme,  the  supervisor  too 
knows  in  advance  when  the  sensor  will  generate 
and  transmit  samples. 

Level-triggered  sampling:  The  sensor  chooses  the  sam¬ 
pling  times  based  on  times  the  error  signal  xt  —xt 
crosses  chosen  thresholds.  The  actual  sampling 
times  are  the  lesser  of  these  threshold  times  and 
the  end  time  T .  The  sampling  times  are  dependent 
on  the  actual  sensor  observations. 


Optimal  sampling:  To  choose  sampling  times,  the  sen¬ 
sor  solves  a  sequence  of  optimal  stopping  time 
problems  and  applies  the  resulting  stopping  rule. 
Here  too,  these  times  are  dependent  on  the  actual 
sensor  observations. 

We  will  first  characterize  the  performance  in  the  single 
sampling  instance  where  N=l. 

A.  Single  Sample  case 

We  can  express  the  distortion  J  as  follows: 

J(  Tj)  =  E[  ' 


Lemma  2.1:  Given  that  x0  =  0  and  the  threshold  rj, 
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Since,  h(xr  ,Trj)  =  0,  the  result  holds.  ■ 

We  can  now  obtain  the  the  pdf  of  Tq  by  obtaining  the  inverse 
Laplace  transform  of  Fq  (s).  We  can  then  write: 


IT  J  T 

Now  notice  that  the  second  term  can  be  written  as  follows 
E 

where  we  have  used  the  strong  Markov  property  of  x,  that 
for  t  >  Tj  we  have  E[x(|  J^T]  =  xT  =  xt.  Because  of  this 
observation  the  performance  measure  J( Tj)  takes  the  form 
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The  above  expression  is  valid  for  all  stopping  times  Tj 
satisfying  0  <  Tj  <  T . 

1 )  Optimum  deterministic  sampling:  When  Tj  is  deter¬ 
ministic,  the  distortion  becomes: 

T 2 

=  Y-(r-Ti)Ti- 
T2 

mmJ(xl)  =  — . 

T 

argminTi7(T1)  =  -. 

2)  Optimum  level-triggered  sampling:  For  a  given  77  >  0, 
let  Tjj  denote  the  following  level-crossing  time: 

Tn{t:\xt\>T)}. 
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2 nj  J  z2cosh(?7v/2z) 

The  last  integral  is  on  a  contour  which  encompasses  the 
entire  left  half  plane.  We  compute  it  through  an  application 
of  the  residue  theorem.  Firstly,  z  =  0  is  not  a  pole  of  the 
integrand  as  it  is  a  double  zero  for  its  numerator.  The  only 
poles  of  the  integrand  come  from  zeroes  of  the  function 
cosh(rj\/2z)-  These  are: 

*=-(2*+!)’^,  *  =  0,1,2,... 

and  they  all  lie  inside  the  left  half  plane.  The  contour 
integral  we  need  to  compute  is  given  through  a  sum  of 
the  residues  at  all  of  these  poles: 
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The  actual  sampling  time  is  given  by 

and  the  distortion  by 
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Using  the  above  result,  the  distortion  can  be  written  as: 
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We  do  not  have  a  closed  form  expression  for  the  pdf  of 
the  stopping  time  Tq.  There  is  a  series  expansion  provided 
in  page  99  of  [8]  which  is  not  directly  useful  to  our  cal¬ 
culations.  Instead,  we  compute  a  version  of  the  generating 
function  of  Tq . 


T2 
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where, 
%2T 


—  _m  J  ke-^n-i  +  {2k+\yx 

(4) 


Because  of  the  above  relations,  we  can  see  that  minimizing 
the  distortion  J  with  respect  to  the  threshold  /]  is  the  same 
as  minimizing  it  with  respect  to  A: 

T2 

tnf/(T^)  =  inf  —  (j?(A). 

If  A0  minimizes  <p(-),  then  the  optimum  threshold  rj0  is 
given  by: 


%  = 


2\/2A0 


Vt. 


Numerically  evaluating  A0,  we  get  rj0  =  0.939 1  VT  and 
the  corresponding  minimum  distortion  to  be  0.3952(r2/2). 
This  is  about  a  20%  improvement  over  the  minimum 
distortion  of  the  deterministic  sampling  scheme:  0.5(r2/2). 

3)  Optimum  single  sampling:  Now,  we  seek  the  sam¬ 
pling  strategy  that  minimizes  the  distortion.  We  seek  a 
stopping  time  Tj  satisfying  0  <  X1  <  T  and  minimizing: 

=  a  xTi(r  —  tj)  . 

Only  the  second  term  depends  on  the  stopping  time  xx. 
Furthermore,  relaxing  the  constraint: 

Tj  <  T 

does  not  change  the  optimum  or  the  optimizing  policy. 

So,  we  are  interested  in  the  following  optimal  stopping 
problem: 


since,  for  this  choice,  the  difference: 

sM-AT-t)=A^-T^>0, 

becomes  a  perfect  square.  And  we  do  have  equality  for  pairs 
(x,f)  such  that  x2  =  s/3 (T  —  t).  Thus  the  optimal  stopping 
rule  is  given  by: 

T,  =  inf  |? :  x2  >  s/3 (T-f)|. 

The  corresponding  minimum  distortion  is  given  by: 

T2  s/3  T2  1  T2  T 2 

J(x H  =  - ——  —  = - =  —  =  0.366  —  . 

2  l  +  s/3  2  1  +  ^3  2  2 

B.  N-Sample  case 

We  will  now  use  the  results  of  section  II-A  to  characterize 
the  performance  of  the  three  sampling  strategies  when  the 
allowed  number  of  samples  is  more  than  one. 

1)  Deterministic  sampling:  We  will  show  through  in¬ 
duction  that  uniform  sampling  on  the  interval  [0. 7’]  is  the 
optimal  deterministic  choice  of  N  samples 
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given  that  the  initial  value  of  the  signal  is  zero. 

When  the  number  of  samples  permitted  is  A,  the  distor¬ 
tion  takes  the  form: 
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Consider  the  following  candidate  maximum  expected  reward 
function'. 
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If  we  assume  that  the  optimal  choice  of  N  —  1  deterministic 
samples  over  [Tt ,  T2]  is  the  uniform  one: 
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where  A  is  a  constant  to  be  specified  subsequently.  Since 
xt  =  Wt  we  have: 


dg(xt,t)  =  A  |—  [(T  —  t)  +  W2]  dt  + 

+A{[(r-f)+w,2]  *}, 
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the  corresponding  minimum  distortion  becomes: 
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The  minimum  distortion  over  the  set  of  N  sampling  times 
is: 
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If  x  is  any  stopping  time,  we  have: 
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E  [g(xT,  x)  |x0]  =  g(x0 , 0)  =  g(x,  t )  =  A  { l  T2  +  xi  (T)  +  ^  }  . 

Let  us  now  pick  A  such  that  g{xf)  >  x2(T  —  t)  but  also  with 
equality  for  some  family  of  pairs  (xf).  Then 
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the  minimum  being  achieved  for  T,  =  T/(N  +  1 ) .  This 
proves  the  assertion  about  the  optimality  of  uniform  sam¬ 
pling. 

2)  Level  triggered  sampling:  Here,  the  sampling  times 
are  defined  through:  For  i  =  1,2 N 


T0  =  0, 
li  >  o, 

TM!,=illf{f; 
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Like  in  the  single  sample  case,  we  will  show  that  the 
expected  distortion  over  [0,7’]  given  at  most  N  samples  is 
of  the  form 


Let  T ^  be  the  level-crossing  time  of  equation  (3).  Then, 
given  a  positive  real  number  a,  the  following  minimal  cost 
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turns  out  to  be: 


P  (T-zvY 


where  p  >  0  depends  only  on  a. 

We  will  now  prove  this  useful  fact.  Notice  that: 

d  [(r  —t)xf]  =  —x2dt  +  2(T  —t)xtdxt  +  (T  —  t)dt, 

and  that, 
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Thus,  the  cost  (5)  becomes: 

S(V)  =  -^Y 


Then  we  have  the  optimal  cost  (5)  as: 

™n  f(r\)  =  T—  inf  {9(A)  +  (0.5  -  a)  y{X)} . 

In  section(II-A.2),  we  saw  that  the  minimal  distortion  for 
the  level-trigerred  scheme  with  a  single  sample  allowed  is: 

t2  rr  2 

C.—  =0.3952— . 

1  2  2 

Based  on  the  above  discussion,  we  can  define  c.  recursively 
as  follows:  For  k>  2, 

c<:  =  inf  { tp(A )  +  (0.5  -  Ca._  1 )  t//(A ) }  , 
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K  =  arg inf  { <P(A)  +  (0-5  -  ck_x ) yr(A) } , 
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Pk  = 

The  optimal  set  of  thresholds  are  given  by: 

Pk  =  PN~k+ 1  —  Vf 

3 )  Optimal  multiple  sampling :  Exactly  like  in  the  previ¬ 
ous  discussion  on  multiple  level-triggered  sampling,  we  will 
obtain  a  parametric  expression  for  the  minimal  expected 
distortion  given  at  most  k  samples. 

Analogous  to  equation  (5),  consider  the  stopping  cost: 
r  r*AT 
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where  a  >  0  is  a  given  constant.  We  can  rewrite  this  as: 

1 
2 

Note  that  there  is  no  change  in  optimality  by  permitting  t 
to  take  values  bigger  than  T.  In  fact  the  optimal  T  even  with 
this  relaxation  will  a.s.  be  less  than  T.  Like  in  the  single 
sample  case,  let  us  pay  attention  to  the  part  of  the  above 
expression  which  depends  on  T  and  define  the  following 
optimal  stopping  problem: 
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minE 


-I  ^  —  a  1  E 


(T~znY 


1 2 


The  above  expression  is  convenient  because,  following 
section(II-A.2),  we  can  rewrite  this  in  terms  of  T  and  A 
(4)  alone.  We  have: 


£/(I?)  =  — 


<pW  +  (  \  ~a)  vW 


Consider  the  candidate  maximum  expected  reward  function: 

g(x,t)  =A  j(7’-f)2  +  2x2(7’-f)  +  y 

where  A  is  a  constant  chosen  such  that  g(x,t )  —2 x2(T  —  t)  — 
(1  —  Oc)(T  —  t)2  becomes  a  perfect  square.  The  only  possible 
value  for  A  then  is: 


where,  y/  is  defined  through: 

V/(A)  = 

16  ,  ke~(2k+P22i  —  1  +  (2k  +  1)2A  —  (1/2) (2k 4-  l)4x2 


(5  +  ct)  —  ^/(5  +  a)“  —  24 


sp2(-o* 


A=0 


(2k+iy 


Then  the  optimal  stopping  time  is  given  by: 

T*  =  inf  {f :  g(xt,t)  <  2 xf(T  —  t)  +  (1  —  a)(T  —  f)2}  , 
/3(A-l  +  a) 


=  inf  <  t :  xf  > 


<T-t ) 


(6) 


and  the  corresponding  optimal  distortion  J?  being 


A.  Optimum  deterministic  sampling 


=  (1-A)y 


Now,  we  obtain  the  explicit  stopping  rules  and  the  cor¬ 
responding  minimal  distortions  for  different  values  of  the 
sample  budget  N  by  defining  recursively  kn  .  y v : 


(5  +  kn-i)  —  \J (5  +  %_ i)2  ^24 


7n  = 


1  -  K., 


The  (k  +  l)th  sampling  time  is  chosen  as: 


We  will  show  through  induction  that  uniform  sampling 
on  the  interval  [0,  T ]  is  the  optimal  deterministic  choice  of 
N  samples 

|  0  <  t i<T,  T,  >  X-_ ,  for  i=  1,2, ... ,N} 

given  that  the  initial  value  of  the  signal  is  zero. 

When  the  number  of  samples  permitted  is  N,  the  distor¬ 
tion  takes  the  form: 

r  T* 

"^[o.r]  ({ Ti  ’  >  •  •  •  ^n})  =  ixs~xs)  ds 

/^2  /*r 

(xs  —  xs)2ds  +  ■  ■  .  T  /  (xs  —  xs)2ds. 

1  ^  ^7V 


bt+t 


inf  1 f : 


—  Yn-1c+  1 


If  we  assume  that  the  optimal  choice  of  iV  —  1  deterministic 
samples  over  [7] .  T2  \  is  the  uniform  one: 


4)  Comparisons:  We  now  list  a  numerical  comparison 
of  the  aggregate  filtering  distortions  incurred  by  the  three 
sampling  strategies  on  the  same  time  interval  [0.  T],  We  ob¬ 
tained  the  distortions  for  all  sampling  strategies  as  product 
of  T2/ 2  and  a  positive  coefficient.  The  numbers  listed  in 
the  table  are  these  coefficients. 


N 

1 

2 

3 

4 

Deterministic 

0.5 

0.333 

0.25 

0.2 

Level-triggered 

0.3953 

0.3471 

0.3219 

0.3078 

Optimal 

0.3660 

0.2059 

0.1388 

0.1032 

III.  Sampling  an  Ornstein  Uhlenbeck  process 


Now  we  will  turn  to  the  case  when  the  signal  is  an 
Ornstein  Uhlenbeck  process: 

dxt  =  axrdt  +  dWt,  f  €  [0,  T],  (8) 

with  x0  =  0  and  Wt  being  a  standard  Brownian  motion. 

Again,  the  sampling  times  S  =  {Tj , . . . ,  T^}  have  to  be  an 
increasing  sequence  of  stopping  times  with  respect  to  the 
x-process.  They  also  have  to  lie  within  the  interval  [0,7']. 
Based  on  the  samples  and  the  sample  times,  the  supervisor 
maintains  an  estimate  waveform  x,  given  by 

{0  if  0  <  t  <  Tj , 

xT.ea^~x^  if  x t<t<  xi+1  <  xN,  (9) 
xTjvea^“Tw)  if  XN  <t  <T. 

The  quality  of  this  estimate  is  measured  by  the  aggregate 
squared  error  distortion: 


{Ti,T2,...Ta,_|  }  —  jVj  +i 


t2~t, 


N 


i=l,2,...,N-l}, 


the  corresponding  minimum  distortion  becomes: 


4a2 


The  minimum  distortion  over  the  set  of  N  sampling  times 
is: 


min  J[0yT]({xv  %,...%}) 


=  min 

T , 


Kf‘ 


x,s  —xs)2ds  +  min 

{r2,r2,...rN} 


[t j,r]  ({'rl’T2’-"'rA})|  > 


4a2 
N+  1 
4a2 


i 

2  a 
1 


the  minimum  being  achieved  for  Tj  =T/(N+  1).  Thus,  we 
have  the  uniform  sampling  scheme  being  the  optimal  one 
here  too. 


B.  Optimum  Level-triggered  sampling 

Let  us  first  address  the  single  sample  case.  The  perfor¬ 
mance  measure  then  takes  the  form 

</0i)  =  E  [  '*7+  [  ( Xt-xt)2dt 

JO 

rT  rT  rT 

=  E  /  x2  —  2  /  x,xtdt+  /  ( x,)2dt  . 

JO  J  Tj  J  Tj 

Now  notice  that  the  second  term  can  be  written  as  follows 


E 


J  xtxtdt  =E  J  E[x,| ^X[\x,dt  =E  J  ( xt ) 


'  i it 


where  we  have  used  the  strong  Markov  property  of  xt  that 
for  t  >  Tj  we  have  E  [xt  \  ,5?T|  ]  =xTe~a^“Ti)  =xt.  Because  of 


this  observation  the  performance  measure  ^/(Tj)  takes  the  denoted  Gv (T  -  7] ) .  Then,  the  1V+1  sampling  problem 

can  be  solved  as  follows.  Let  UjX+}(x,t)  satisfy  the  PDE: 


form 


£/K)=  e 

elaT  -\-2aT 


rT  rT 

/  xj  (It  —  /  ( xt)2dt 
J  0  Jt, 


T 

1 

-E 


4a2 

' e2aT  _i_2 aT 
A{aT)~ 


-E 


e2a(r-r1)_1 
1  2  a 

X2  e2(aT)(\-xJT)  _  j 


e“M-l+2d 

4d2 


-E 


r  2(ar) 

e2a(l-fj)  _  j 


‘1  2d 


where 


t  x, 

t  =  — ;  a  —  aT\  x.T=—'=. 

t  '  Vr 


4a2 

(10)  -|e 


We  have  x  satisfying  the  following  SDE: 

<7x-  =  —axjdt  +  dwf. 


Tj Uxx  +  axf/Y  +  (/,  —  0, 

along  with  the  boundary  and  initial  conditions: 

=  UN+l{t]vt)  =  GN(T-t)  for  t  €  [0,7’], 
|f/A,+1(x,7’)  =0  for  x  €  [— rjj ,  rjJ. 

Then  the  distortion  we  are  seeeking  to  minimize  over  r/1  is 
given  by: 

a„2  V  '2a 


?M-1)_1  +  4L(eM-1)_1)_^(r_Ti) 

+  E[Gw(T-t1)] 


=  ^2  -  1)  -  ^  (e2aT  (1  +2«G1(0,0))  -  1)  -t/w+1. 


This  suggests  that,  without  loss  of  generality,  we  can  limit 
ourselves  to  the  normalized  case  T  =  1  since  the  case  T  /  1 
can  be  reduced  to  it  by  using  the  transformations  in  (10).  In 
fact,  we  can  solve  the  multiple  sampling  problem  on  [0,7"] 
without  loss  of  generality. 

We  carry  over  the  definitions  for  threshold  sampling  times 
from  section  II-B.2  We  do  not  have  series  expansions  like 
for  the  case  of  the  Wiener  process.  Instead  we  have  a 
computational  proceudre  that  involves  solving  a  PDE  initial 
and  boundary  value  problem.  We  have  a  nested  sequence  of 
optimization  problems.  The  choice  at  each  stage  being  the 
non-zero  level  fy.  For  N  =\,  the  distortion  corresponding 
to  a  chosen  T]1  is  given  by: 


(«2"r-l)-f7-^E 


?2a{T- Tj)  _  j 


=  h  (c“  -  ■ -  hT~  I  ('“■  (■ -  >)  • 

where  the  function  U]  (x.t)  satisfies  the  PDE: 

—  Uxx  +  axUx  +  Ut  +  e  ~~at  =  0, 

along  with  the  boundary  and  initial  conditions: 

for  t  €  [0,7"], 
for  x€  [-TJuVi]- 


U1(-rh,t)  =  U1(rivt)  =  0 
Ul(x,T)  =  0 


We  choose  the  optimal  T]1  by  computing  the  resultant 
distortion  for  increasing  vlaues  of  r/1  and  stopping  when 
the  cost  stops  decreasing  and  strats  increasing.  Note  that  the 
solution  U (0,f)  to  the  pde  furnishes  us  with  the  performance 
of  the  r;  j -triggered  sampling  over  \t,T].  We  will  use  this  to 
solve  the  multiple  sampling  problem. 

Let  the  optimal  policy  of  choosing  N  levels  for  sampling 
over  [7’j, 7"]  be  given  where  0  <  7]  <  7’.  Let  the  resulting 
distortion  be  also  known  as  a  function  of  7] .  Let  this  known 
distortion  over  [7] ,  7"]  given  N  leveltriggered  samples  be 


We  choose  the  optimal  T]l  by  computing  the  resultant 
distortion  for  increasing  values  of  rjt  and  stopping  when 
the  distortion  stops  decreasing. 


C.  Optimal  Sampling 

We  do  not  have  analytic  expressions  for  the  minimum 
distortion  like  in  the  Brownian  motion  case.  We  have  a 
numerical  computation  of  the  minimum  distortion  by  finely 
discretizing  time  and  solving  the  discrete-time  optimal 
stopping  problems. 

By  discretizing  time,  we  get  random  variables  Xj, . . .  ,xM, 
that  satisfy  the  AR(1)  model  below.  For  1  <  n  <M 


x„  =  eaSxn_  j  +  w„,  wn  ~  0, 


„2a8 


-1 


2  a 


1  <  n  <  M. 


{w,,}  is  an  i.i.d.  Gaussian  sequence. 

Sampling  once  in  discrete  time  means  selecting  a  sample 
xv  from  the  set  of  M+  1  sequentially  available  random 
variable  x0,...,xM,  with  the  help  of  a  stopping  time  V  € 
{0, 1, . . .  ,M}.  We  can  define  the  optimum  cost  to  go  which 
can  be  analyzed  as  follows.  For  n  =  M,M  —  1 , . . . ,  0, 


Vni*) 


sup  E 

n<v<M 


g2a8(M—v) 
2  a 


max 


g2a8(M—n) 

2  a 


1 


)E[V„Vl(Xn+l)lX« 


The  above  equation  provides  a  (backward)  recurrence  re¬ 
lation  for  the  computation  of  the  single  sampling  cost 
function  V,\  (x).  Notice  that  for  values  of  x  for  which  the 
l.h.s.  exceeds  the  r.h.s.  we  stop  and  sample,  otherwise  we 
continue  to  the  next  time  instant.  We  can  prove  by  induction 
that  the  optimum  policy  is  a  time-varying  threshold  one. 
Specifically  for  every  time  n  there  exists  a  threshold  X„ 
such  that  if  |x„|  >  X„  we  sample,  otherwise  we  go  to  the 
next  time  instant.  The  numerical  solution  of  the  recursion 


presents  no  special  difficulty  if  a  <  1 .  If  V)  (x)  is  sampled 
in  x  then  this  function  is  represented  as  a  vector.  In  the 
same  way  we  can  see  that  the  conditional  expectation  is 
reduced  to  a  simple  matrix-vector  product.  Using  this  idea 
we  can  compute  numerically  the  evolution  of  the  threshold 
A,  with  time.  The  minimum  expected  distortion  for  this 
single  sampling  problem  is: 


e2aT  -1-2 aT 
4  a2 


V<>(0). 


For  obtaining  the  solution  to  the  N+ 1 -sampling  problem, 
we  use  the  solution  to  the  iV-sampling  problem.  For  n  = 
M,M—  1,...0, 


v  W  = 


sup  E 

n<v<M 


12  a8(M—v)  i 

,  r2e 

Vy  (UJ  +  Xy 

la 

Xn  —  X 

=  max  <  V;f(0)  +x' 


„2a8(M—n)  _  j 

2  a 


(ID 


VnN+l  (0)  +  E  [y„Vt  (*„+i)  =  x]  }  ■  (12) 


Below  is  a  tabular  comparison  of  the  relative  perfor¬ 
mance: 


N 

1 

2 

3 

4 

a  =  — 10 

Deterministic 

0.0450 

0.0425 

0.0401 

0.0377 

Level-triggered 

0.0403 

0.0377 

0.0357 

0.0341 

Optimal 

0.0369 

0.0302 

0.0254 

0.0219 

a  =  —5 

Deterministic 

0.0801 

0.0711 

0.0633 

0.0568 

Level-triggered 

0.0716 

0.0650 

0.0601 

0.0576 

Optimal 

0.0616 

0.0462 

0.0366 

0.0301 

a  =  —  1 
Deterministic 

0.1839 

0.1351 

0.1065 

0.0879 

Level-triggered 

0.2576 

0.2059 

0.1725 

0.1553 

Optimal 

0.1308 

0.0802 

0.0562 

0.0425 

a  =  1 

Deterministic 

0.3591 

0.2108 

0.1487 

0.1148 

Level-triggered 

0.2714 

- 

- 

- 

Optimal 

0.2613 

0.1182 

0.0630 

0.0345 

IV.  Concluding  remarks 


We  have  furnished  methods  to  obtain  good  sampling 
policies  for  the  finite  horizon  filtering  problem.  When 
the  signal  to  be  kept  track  of  is  a  Wiener  process,  we 
have  analytic  solutions.  When  the  signal  is  an  Ornstein- 
Uhlenbeck  process,  we  have  provided  computational  recipes 
to  determine  the  best  sampling  policies  and  their  perfor¬ 
mance. 

We  will  report  elsewhere  on  the  solution  to  the  case  when 
the  sensor  has  access  only  to  noisy  observations  of  the 
signal  instead  of  perfect  observations.  This  leads  us  to  some 
simple  multi-sensor  sampling  and  filtering  problems  which 
can  be  solved  in  the  same  way. 


The  case  where  the  samples  are  not  reliably  transmitted 
but  can  be  lost  in  transmission  is  computationally  more 
involved.  There,  the  relative  performances  of  the  three 
sampling  strategies  is  unknown.  However,  in  principle,  the 
best  policies  and  their  performances  can  be  computed  using 
nested  optimization  routines  like  we  have  used  in  this  paper. 

Another  set  of  unanswered  questions  involve  the  perfor¬ 
mance  of  these  sampling  policies  when  the  actual  objective 
is  not  filtering  but  control  or  signal  detection  based  on  the 
samples.  It  will  be  very  useful  to  know  the  extent  to  which 
the  overall  performance  is  diminished  by  using  sampling 
designs  that  achieve  merely  good  filtering  performance. 
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